Machine learning system and method for determining or inferring user action and intent based on screen image analysis

ABSTRACT

System(s) and method(s) that analyze image data associated with a computing screen operated by a user, and learns the image data (e.g., using pattern recognition, historical information analysis, user implicit and explicit training data, optical character recognition (OCR), video information, 360°/panoramic recordings, and so on) to concurrently glean information regarding multiple states of user interaction (e.g., analyzing data associated with multiple applications open on a desktop, mobile phone or tablet). A machine learning model is trained on analysis of graphical image data associated with screen display to determine or infer user intent. An input component receives image data regarding a screen display associated with user interaction with a computing device. An analysis component employs the model to determine or infer user intent based on the image data analysis; and an action component provisions services to the user as a function of the determined or inferred user intent. In an implementation, a gaming component gamifies interaction with the user in connection with explicitly training the model.

CROSS REFERENCE TO RELATED APPLICATIONS

This patent application claims the benefit of U.S. ProvisionalApplication Ser. No. 62/559,180, filed on Sep. 15, 2017, the entirety ofwhich is incorporated herein by reference. This patent application alsoclaims the benefit of U.S. Provisional Application Ser. No. 62/641,486,filed on Mar. 12, 2018, the entirety of which is incorporated herein byreference.

TECHNICAL FIELD

The subject disclosure relates to machine learning system(s) and/ormethod(s) for determining or inferring user action and/or user intentbased on screen image analysis.

SUMMARY

The following presents a summary to provide a basic understanding of oneor more embodiments of the invention. This summary is not intended toidentify key or critical elements, or delineate any scope of theparticular embodiments or any scope of the claims. Its sole purpose isto present concepts in a simplified form as a prelude to the moredetailed description that is presented later. In one or more embodimentsdescribed herein, devices, systems, computer-implemented methods,apparatus and/or computer program products are described.

The innovations described herein relate to a system to facilitatedetermining or inferring user intent and/or user action. Conventionaluser behavior monitoring and tracking systems are in general limited toanalyzing data within a particular application programming interface(API) or web interaction (e.g., a Face Book instance, a web browsinginstance, an application instance, and so on). Such systems thus analyzedata within a specific instance and context, and so determinations orinferences regarding user behavior or intent are often limited to theparticular data vertical analysis. However, users are often multitaskingwithin a computing environment, and consequently conventional systemsare inadequate for learning the larger corpus of information pertainingto a user. For instance, if a user operates a first program (e.g., aword processor) and a second program (e.g., a webpage) simultaneously ona computing device, conventional user behavior tracking systems canmonitor the user's actions in only one of the first program or thesecond program, but not in both. Multiple conventional monitoringsystems would have to be implemented to monitor the user's interactionsin multiple programs (e.g., a specialized monitoring system dedicated tomonitoring the user's actions in each distinct program, and so on). Buteven then, the recorded actions would not be correlated or otherwisemultiplexed into a single timeline/data structure, thereby still failingto holistically track the user's overall computing behavior. One or moreembodiments of the subject innovation can address these problems.Specifically, in various embodiments, the subject innovation can analyzeimage data associated with a computing screen operated by a user, andcan learn the image data (e.g., using pattern recognition, historicalinformation analysis, user implicit and explicit training data, opticalcharacter recognition (OCR), video information, 360°/panoramicrecordings, and so on) and can concurrently glean information regardingmultiple states of user interaction (e.g., analyzing data associatedwith multiple applications open on a desktop, mobile phone, tablet,television, vehicle display/interface, and so on).

Conventional methods of capturing screen interaction do not allow formulti-day or unlimited continuous recording (e.g., that could recordyears of computing interaction). Instead, conventional methods ofcapturing screens either work on a limited time basis (e.g., usingscreen-recording applications used for that process) or capturingscreens on a periodic basis (e.g., every 3 seconds). In one or moreembodiments, a system can allow for such capturing across any number ofdevices and/or screens. A specially designed capture and compressionsystem can facilitate such types of recording and/or capturing of fullinteraction data continuously with minimal CPU usage.

More particularly, a machine learning model can be trained on analysisof graphical image data associated with screen display to determine orinfer user intent and/or action. An input component can receive imagedata regarding a screen display associated with user interaction with acomputing device. An analysis component can employ the model todetermine or infer user intent based on the image data analysis; and anaction component can provision services to the user as a function of thedetermined or inferred user intent. In an implementation, a gamingcomponent can gamify interaction with the user in connection withexplicitly training the model. A reward component can be employed toexpose functionality associated with the trained model as a function ofuser training efforts.

In an aspect, an interactive component can share status of training ofthe model with other entities. For example, friends can expose status ofa gaming component which exposes level of training achieved andfunctionalities available. In yet another aspect, an avatar componentcan generate an avatar that interacts with the user based in part on themodel and the action component. A training component can generate aninterface for explicitly training the model by the user. The trainingcomponent can question the user regarding accuracy of determinations orinferences generated by the model.

The model can employ a recursive learning algorithm or backwardpropagation of learning across other models or continuous learningalgorithm. The model can learn impact of respective images and directthe action component to revise actions as a function of the learnedimpact. An optimization component can generate inferences, based on themodel, regarding potential points of failure, weakness or bottlenecks inconnection with taking automated action. A scheduling component can beutilized to schedule activities for the user. The model can identify twoor more applications running concurrently on the screen, and candetermine or infer user intent regarding the respective applications.

In some embodiments, elements described in connection with thecomputer-implemented method(s) can be embodied in different forms suchas a system, a computer program product, or another form.

DESCRIPTION OF THE DRAWINGS

FIG. 1 illustrates a system in accordance with one or more embodimentsdescribed herein.

FIG. 2 illustrates a high-level diagram of an aspect of the innovationsin accordance with one or more embodiments described herein.

FIG. 3 illustrates a high-level diagram of an aspect of the innovationsin accordance with one or more embodiments described herein.

FIGS. 4-7 illustrate example user interfaces and associated componentsin accordance with one or more embodiments described herein.

FIG. 8 illustrates an example, non-limiting methodology in accordancewith one or more embodiments described herein.

FIGS. 9A-9J illustrate a computing device interaction archival andretrieval tool in accordance with one or more embodiments describedherein.

FIG. 10 illustrates a block diagram of an example, non-limitingoperating environment in which one or more embodiments described hereincan be facilitated.

DETAILED DESCRIPTION

The following detailed description is merely illustrative and is notintended to limit embodiments and/or application or uses of embodiments.Furthermore, there is no intention to be bound by any expressed orimplied information presented in the preceding Background or Summarysections, or in the Detailed Description section.

One or more embodiments are now described with reference to thedrawings, wherein like referenced numerals are used to refer to likeelements throughout. In the following description, for purposes ofexplanation, numerous specific details are set forth in order to providea more thorough understanding of the one or more embodiments. It isevident, however, in various cases, that the one or more embodiments canbe practiced without these specific details.

The innovations described herein relate to a system to facilitatedetermining or inferring user intent and/or user action. Conventionaluser behavior monitoring and tracking systems are in general limited toanalyzing data within a particular application programming interface(API) or web interaction (e.g., a Face Book instance, a web browsinginstance, an application instance, and so on). Such systems thus analyzedata within a specific instance and context, and so determinations orinferences regarding user behavior or intent are often limited to theparticular data vertical analysis. However, users are often multitaskingwithin a computing environment, and consequently conventional systemsare inadequate for learning the larger corpus of information pertainingto a user. The subject innovation can analyze image data associated witha computing screen operated by a user, and can learn the image data(e.g., using pattern recognition, historical information analysis, userimplicit and explicit training data, optical character recognition(OCR), video information, 360°/panoramic recordings, and so on) and canconcurrently glean information regarding multiple states of userinteraction (e.g., analyzing data associated with multiple applicationsopen on a desktop, mobile phone or tablet). The model can analyzemultiple sets of screen image data in a time series manner For example,if a user is simultaneously using a word processing program, an emailaccount, and another internet webpage (e.g., such that part of the wordprocessing program, part of the email account, and part of the internetwebpage are displayed concurrently on the screen), the subjectinnovation can analyze the entire screen data over a period of time soas to monitor and/or track user behavior and/or interaction in all threeinstances (e.g., what document was open and/or what words/sentences weretyped in the word processor, what emails were sent/received/read and/orto/from whom in the email account, and what browser was used and/or whatlinks were selected in the internet webpage, and so on), therebyconstituting a significant improvement over conventional systems.

FIG. 1 illustrates system 100 that facilitates learning, determining andpredicting user intent. A processor 102 can execute computer executablecomponents stored in a memory 104. An input component 106 can receiveimage data 107 regarding a screen display associated with userinteraction with a computing device. For example, the input componentcan capture images of the computing screen or a subset thereof.Additionally, the input component 106 can also capture any type ofextrinsic information (e.g., audio, motion, location, weather,temperature, and so on) to facilitate determining or inferring userstate and associated context. In various embodiments, the inputcomponent 106 can include (and/or be communicatively connected to) anelectroencephalograph (EEG device) (e.g., a wired and/or wirelessheadset with EEG sensors coupled thereto, and so on) to receive and/orrecord brainwave data from the user. Moreover, any other bio-sensors canbe incorporated to record/measure other physical measurements and/orbio-data (e.g., heartbeat, body surface tension, body temperature, bodymovements, micromovements and/or twitches, eye tracking, and so on). Insuch cases, one or more brainwaves of the user (and/or any otherbio-data of the user) can be monitored so as to detect emotions,thoughts, and/or intentions while the user interacts with the computingdevice(s). Such measured bio-data can be correlated in time withon-screen interactions and/or on-screen objects. A model generationcomponent 108 can generate a model 110 that can be trained on data fromthe input component 106 as well as user provided data (e.g., implicit orexplicit training data), and through a machine learning component 112the model 110 can converge on determining, predicting or inferring userintent with high confidence. After a particular confidence level isachieved, the model can facilitate the system taking automated action(e.g., using a utility-based analysis where the benefit of taking anautomated action correctly is weighed against the cost of taking anincorrect action). An analysis component 114 can employ the model 110 todetermine or infer user intent based on the image data analysis; and anaction component 116 can provision services to the user as a function ofthe determined or inferred user intent. In an implementation, a gamingcomponent 118 can gamify interaction with the user in connection withexplicitly training the model 110. A reward component 120 can beemployed to expose functionality associated with the trained model 110as a function of user training efforts.

In various embodiments, the system (e.g., via the machine learningcomponent 112 and/or the analysis component 114) can analyze the images,screenshots, and/or extrinsic data 107 by employing focus objects and/orfocus sections. Focus objects/sections can represent visual portions(e.g., groups of pixels) of a particular on-screen display that havemore importance/relevance than other portions of the on-screen display.In some cases, the focus objects/sections can representchanging/transient visual portions of the screen; in other cases, theycan represent non-transient visual portions of the screen. Thus, focusobjects/sections can enable the system to distinguish between importantand unimportant changes in the on-screen display (e.g., changes from onestored image to another among a sequence of stored images, screenshots,and extrinsic data 107, and so on). For example, a digital clock icon isgenerally displayed in a corner of a computer display screen. Althoughthe displayed time changes every minute (e.g., the pixels in thatportion of the screen change to show the new time), such on-screenchanges are usually not very important/relevant regarding theinteractions/tasks performed by the user on the computing device (e.g.,the time constantly changes, but such changes often convey nosubstantive information regarding what the user was doing on thecomputing device when those screens were displayed). In contrast,changes made to the body of a word document, a change in the recipientfield and/or carbon copy field of an email, and/or a change in websiteURLs can be very important/relevant regarding the substantive actionsthat the user was performing. Thus, in some instances, the portion ofthe screen that displays the clock can be marked as not a focusobject/section. Conversely, the portions of the screen displaying emailrecipient fields, email subject fields, bodies of word processingdocuments, internet browser fields, and so on can be marked as focusobjects/sections (e.g., indicating that such portions are substantivelyimportant). Focus objects/sections as described can be implemented viadirty rectangle algorithms known in the art, in conjunction with OCR andother image analysis. In some embodiments, continuous, periodic, and/orregular on-screen changes (e.g., changing of clock icon every minute,and so on) can be recognized via image analysis and can accordingly benoted as not a focus object/section. In some embodiments, unique,nonperiodic, and/or irregular on-screen changes (e.g., writing/deletingwords in a word document/email, and so on) can similarly be recognizedvia image analysis and can accordingly be noted as focusobjects/sections. In still more embodiments, each focus object/sectioncan have a respective time associated with it, which time represents theamount of time throughout a day (e.g., or any other time period) duringwhich the focus object/section was displayed on the screen.

It is to be appreciated that not all components depicted in FIG. 1 arerequired for operation of the system 100, and that the system 100 can beimplemented with a subset of the components illustrated.

FIG. 2 illustrates an implementation of the system 100 that includes aninteractive component 202 that can share status of training of the model110 with other entities. For example, friends can expose status of agaming component 204 which exposes level of training achieved andfunctionalities available. It is to be appreciated that the user ofembodiments herein can include entities, groups, machines, components,humans, and so on. The gaming component 204 can provide in a highlyglanceable manner status of training of the model 110 andfunctionalities achieved thus far to the user and others. Accordingly, acompetitive or incentivized environment can be created within a socialnetwork to encourage respective parties to continue training respectivemodels, e.g., to stay ahead of others, achieve new functionalities, andcreate a model that provides for high-confidence automated action.

In yet another aspect, illustrated in FIG. 3, an avatar component 302can generate an avatar that interacts with the user based in part on themodel 110 and the action component 116. A training component 304 cangenerate an interface for explicitly training the model 110 by the user.The training component 304 can question the user regarding accuracy ofdeterminations or inferences generated by the model. In variousembodiments, the training component 304 can give the user actions toperform and/or fulfill, in order to facilitate training on a known set.By purposely and controllably giving the user tasks to perform, thesystem can facilitate known/explicit training of itself. For example,the system can present a notification to a user that it is Mother's Dayand that the user should send a message to the user's mother. The systemcan then record (e.g., via the input component 106 and the images/data107) how the user performs such assigned tasks. Such recordedperformances/actions can then be examined and/or analyzed by the system(e.g., via the machine learning component 112 and/or the analysiscomponent 114) so as to learn how to perform such tasks (e.g., sending amessage to the user's mother and/or to others) in the future. In variousembodiments, this type of purposeful and controlled training of thesystem can be performed in addition to the observational/passivetraining of the system discussed above (e.g., applying image analysisand/or OCR to the received/recorded images, screenshots, and/orextrinsic data 107 without prompting the user to perform certain tasks).An optimization component 306 can optimize automated action taken by theaction component 116. As described further below, the avatar 302 canserve as a personal assistant to the user and engage in dialog with theuser in connection with training the model, confirming accuracy ofpredictions, taking automated action or seeking approval, and so on.

The model 110 can employ a recursive learning algorithm or backwardpropagation of learning across other models or continuous learningalgorithm. The model 110 can learn impact and/or level ofimportance/relevance of respective images (e.g., via focusobjects/sections) and direct the action component 116 to revise actionsas a function of the learned impact. An optimization component 306 cangenerate inferences, based on the model, regarding potential points offailure, weakness or bottlenecks in connection with taking automatedaction.

The model generation component 108 can generate a set of models 110 ofinfluencers. The machine learning component 112 can utilize datareceived by the input component 106 to score and rank influencers forinsights on optimizing operations.

The model(s) 110 or machine learning component 112 can analyzeinfluencer data with historical data utilizing physics-based, big dataor data science to refine respective models. Big data can be for exampleoperational, geographical, environmental, configuration and other datasets fused together. For example, atmospheric or airline workflowconditions that are measured at various frequencies for an airport canserve as valuable data with reference to data analysis in connectionwith building model(s).

The model(s) 110 or machine learning component 112 can employ artificialintelligence, physical domain expertise and other statistical methods toscore and rank components and generate analytics regarding impact ofrespective influencers 107 on user action(s). The model(s) 110 ormachine learning component 112 can perform recursive learning acrossunstructured subsets of data. Respective models can comprise a neuralnetwork and a Bayesian network and can interact with one another.

The analysis component 114 can employ the model(s) 110 to determineimpact of the state of respective influencers, determinations,predictions or inferences. The model(s) 110 can be updated usingartificial intelligence or machine learning to improve model fidelity asa function of collected data.

The embodiments of devices described herein can employ artificialintelligence (AI) to facilitate automating one or more featuresdescribed herein. The components can employ various AI-based schemes forcarrying out various embodiments/examples disclosed herein. In order toprovide for or aid in the numerous determinations (e.g., determine,ascertain, infer, calculate, predict, prognose, estimate, derive,forecast, detect, and so on) described herein, one or more componentscan examine an entirety or a subset of data to which it is grantedaccess and can provide for reasoning about or determine states of asystem, environment, and so on from a set of observations as capturedvia events and/or data. Determinations can be employed to identify aspecific context or action, and/or can generate a probabilitydistribution over states, for example. The determinations can beprobabilistic—that is, the computation of a probability distributionover states of interest based on a consideration of data and events.Determinations can also refer to techniques employed for composinghigher-level events from a set of events and/or data.

Such determinations can result in the construction of new events oractions from a set of observed events and/or stored event data, whetheror not the events are correlated in close temporal proximity, andwhether the events and data come from one or several event and datasources. Components disclosed herein can employ various classification(explicitly trained (e.g., via training data) as well as implicitlytrained (e.g., via observing behavior, preferences, historicalinformation, receiving extrinsic information, and so on)) schemes and/orsystems (e.g., support vector machines, neural networks, expert systems,Bayesian belief networks, fuzzy logic, data fusion engines, and so on)in connection with performing automatic and/or determined action inconnection with the claimed subject matter. Thus, classification schemesand/or systems can be used to automatically learn and perform a numberof functions, actions, and/or determinations.

A classifier can map an input attribute vector, z=(z1, z2, z3, z4, zn),to a confidence that the input belongs to a class, as byf(z)=confidence(class). Such classification can employ a probabilisticand/or statistical-based analysis (e.g., factoring into the analysisutilities and costs) to determine an action to be automaticallyperformed. A support vector machine (SVM) is an example of a classifierthat can be employed. The SVM operates by finding a hyper-surface in thespace of possible inputs, where the hyper-surface attempts to split thetriggering criteria from the non-triggering events. Intuitively, thismakes the classification correct for testing data that is near, but notidentical to training data. Other directed and undirected modelclassification approaches include, e.g., naïve Bayes, Bayesian networks,decision trees, neural networks, fuzzy logic models, and probabilisticclassification models providing different patterns of independence, anyof which can be employed. Classification as used herein also isinclusive of statistical regression that is utilized to develop modelsof priority.

In one or more embodiments, the system 100 can employ icons to help tosimplify AI learning. As explained above, the system 100 can facilitateits functionality by continuously recording and/or receiving images,screenshots, and/or extrinsic data 107 via the input component 106. Suchimages, screenshots, and/or extrinsic data 107 can include images ofwebsites, program/application interfaces, other graphical userinterfaces (GUIs), and so on. In general, the organization and/orappearance of websites, program/application interfaces, and/or GUIs canbe optimized for human use. For example, websites/GUIs can be designed,organized, and/or arranged so as to be visually appealing and/orotherwise easy to use/maneuver for a human. But, such design,organization, and/or arrangement can, in some cases, not be ideal tofacilitate AI learning via image/screenshot analysis. So, inconspicuouson-screen icons can be used to make the website, program/applicationinterface, and/or GUI more susceptible to AI image analysis. Such iconscan be composed of several and/or a few pixels on screen such that theyare not noticeably visible to humans (e.g., an icon comprising sixpixels having particular colors, settings, intensities, and/orarrangements, and so on). For instance, an icon can be several and/or afew pixels organized as one-dimensional or two-dimensional barcodes, andso on. Such icons can be placed on-screen nearby and/or on importantbuttons, links, and/or other elements of interest, thereby helping thesystem 100 to more easily recognize and/or learn those buttons, links,and/or elements. Those of ordinary skill will appreciate that othertypes and uses of such icons can be incorporated.

In one or more embodiments, such icons can be used as markers and/oridentifiers that can provide deep links to particular moments in time,to particular recorded screen images, and/or to particularapplications/programs, and so on. For example, if a user is conductingonline research for a school/work project, an on-screen icon (asdescribed above) can be added to the screen and/or otherwise associatedwith that research website, thereby providing a deep link (e.g., akin toa hyperlink) to that particular screen and/or website. So, when the useris later searching through his/her recorded on-screen history and/orpersonal timeline (as described below), the user can quickly call/summonthat recorded screen by activating the icon/deep link. This allows theuser to quickly and efficiently search through his/her recorded,on-screen history. Moreover, in various embodiments, the user can reopenthe actual website and/or webpage itself (e.g., more than just viewing arecorded, on-screen session) by activating the icon/deep link. As a morespecific example, consider a user that is using a search engine (e.g.,Google) to search for “mechatronic devices.” Say that the user proceedsto the third page of Google results, where the user sees internet linksof interest to him/her regarding “mechatronic devices.” The user canthen place an icon on that screen (or the system can automatically placethe icon in response to detecting the user's interest in thatwebpage/screen, or the platform can place such icons as part of itsservice, and so on), such that the icon provides a deep link to thatparticular screen in the user's recorded computing history. So, when theuser is reviewing his/her recorded computing history, he/she canactivate the icon/deep link to quickly navigate to the moment in timewhen the third Google results page for the “mechatronic devices” searchwas displayed on his/her screen. Additionally, the user can, in someembodiments, activate the icon/deep link to actually reopen that thirdGoogle results webpage, thereby reconnecting the user to that internetwebpage and enabling the user to access/select the various linkspresented on that webpage. Those of skill in the art will appreciatethat such deep links can be used to quickly navigate to and/or reopenany type of website, webpage, application section (e.g., a user canactivate a deep link to jump to a specific conversation in Slack thathappened in the past, including reopening the generalconversation/message string so as to see and/or respond to the latestmessages, as well as to jump to the actual moment/message that wasmarked with the deep link even if not the latest message), applicationprogramming interface, and so on.

In one or more embodiments, any and/or all of the structured and/orunstructured data recorded by the system can be made available, via anapplication programming interface, for third-party use. Such uses caninclude extending, extrapolating, and/or interpolating the data;analyzing the data; incorporating the data into other programs and/orapplications; building new applications based on the data; and so on.

Furthermore, in one or more embodiments, the above-mentioned AIcapabilities can be implemented via the machine learning component 112and the reward component 120, in conjunction with genetic and/or otheralgorithms, to help train the model(s) 110. For example, the machinelearning component 112 can continuously (and/or substantiallycontinuously) run the model(s) 110 in the computing background while theuser is interacting with the computing device. The model(s) 110 can thencontinuously (and/or substantially continuously) predict whatactions/intentions the user will perform next. If a particular model 110is correct regarding a particular prediction, that model 110 can berewarded by the reward component 120. Genetic algorithms can then beused so as to generate new models that are based on the most successful(e.g., most rewarded) model(s) 110.

In various embodiments, such AI capabilities and/or algorithms can beused to create an identity score (e.g., a mathematical function thatmeasures the differences between a particular model's 110 (and/or theentire system's) predicted actions/intentions and the user's actualactions/intentions). The cycle of continuously running models 110,rewarding successful models 110, and breeding, via genetic algorithms,the most successful models 110 can help to optimize the identity score(e.g., to minimize the differences between the invention's predictedoutcomes and the user's actual behavior/intentions). In other words, themachine learning component 112 and the reward component 120 can, invarious embodiments, create an AI version of the user that runs in asimulated computing environment (e.g., like a virtual version of theuser's machine) and that has the opportunity to react to the sameon-screen situations as the actual user. The AI methods and algorithmsdescribed above (e.g., continuously running and rewarding of models) canbe used to make the AI version of the user more accuratelyreplicate/simulate the actual user (e.g., thereby optimizing theidentity score). Thus, the models 110 can be run, via the machinelearning component 112 and the reward component 120, substantiallycontinuously against the actual user (e.g., a human user) such that thesystem/models can learn from their mistakes (e.g., incorrectpredictions), thereby ultimately generating models 110 that moreaccurately predict user behaviors, actions, and/or intentions. Those ofskill in the art will appreciate that less frequent model training(e.g., less than continuously) can be incorporated.

In one or more embodiments, such AI models and/or algorithms can be usedto simulate/generate different and/or alternative interactions that aredistinct from what the actual user would do and that can be madeavailable for review by the user. In other words, while the abovesection discusses how these AI models/algorithms can be used to “see andpredict what the user would do,” these AI models/algorithms can also beimplemented to “see and predict what the user's alter ego would do.”That is, rather than simply attempting to replicate how the actual userwould behave in a particular computing situation, the AImodels/algorithms can be implemented to simulate various, different,and/or alternative approaches/behaviors that can be taken in thatcomputing situation. This could allow the user and/or the system toobtain new insights into the most and/or least efficientresponses/actions to particular computing situations/scenarios. Forexample, the AI models/algorithms can be implemented to predict how thereal user would actually go about putting an item up for sale on theinternet (e.g., what actions the user would perform on his/her computingdevice(s) to sell an item on eBay). However, the AI models/algorithmscan also be implemented to simulate different methods of selling theitem on the internet, including methods/actions that the real user mightnot perform on his/her computing device(s). These simulations ofalternative actions can then be reviewed and/or compared with the user'sactual actions (and/or with the system's simulations of the user'sactual actions) to determine which are better (e.g., more efficient,more secure, less time consuming, less resource intensive, and so on).To implement such models/algorithms, the reward functions and/orvariables, as described above, could be adjusted to favor differentlybiased actions (e.g., rewarding actions that result in higher focustime, rewarding actions that result in more family time, rewardingactions that result in fewer cyber-security risks, rewarding actionsthat expend fewer computing resources, and so on).

In one or more embodiments, the avatar component 302 can utilize theabove-mentioned AI techniques to help facilitate the system 100 tofunction as a personal artificial intelligence assistant (e.g., personalAI) to the user. Such a personal AI can, in various embodiments, act(e.g., via the action component 116) as a conduit between the user andexternal AI systems (e.g., representing external service providersand/or good providers, and so on) with which the user interacts. Suchexternal AIs can include, for example, Amazon Alexa, Siri, onlineshopping and/or checkout wizards, and so on. For example, the user caninstruct, via keyboard and/or voice commands, the system to ordersomething from an online vendor and/or clearing house. In such case, theuser can provide the pertinent/desired information to the personal AI(e.g., desired good/service, quantity desired, delivery/pick-up datedesired, and so on) via keyboard and/or voice commands, and the personalAI, via the action component 116, can then interact with an external AIprogram of the vendor/clearing house so as to place the order. In someembodiments, the personal AI can place the order (and/or perform anyother specified action) by using data from the Lifegraph and/or PersonalTimeline (discussed below) to respond to the queries of the external AI.In such case, the Lifegraph and/or Personal Timeline data can beconsidered pre-collected briefing information usable by the personal AI,thereby enabling the personal AI to facilitate the user's commandsand/or intentions without always having to actively query the user forinformation. For instance, the personal AI can fill in data fieldsrequested by the external AI and/or answer any data queries from theexternal AI based on the inputted information and/or other storedinformation of the user (e.g., credit card number, date of birth,billing address, and so on), which can include, but is not limited to,Lifegraph and/or Personal Timeline data. Those of skill will appreciatethat the personal AI can perform similar functions with any externalprograms, regardless of whether those external programs utilizeartificial intelligence/machine learning and regardless of thegood/service provided.

Relatedly, the system can, in one or more embodiments, not only performnew actions as instructed by the user, but can also repeat/redo recordedactions, either exactly and/or in the abstract. For example, asdescribed above, the input component 106 can receive and/or recordimages, screenshots, and/or extrinsic data 107 regarding the user'sinteractions with the computing system. These recordings can, in someinstances, contain images, screenshots, and/or videos of the userperforming certain actions/tasks (e.g., ordering a good/service from anonline vendor, making an appointment with a good/service provider,booking a flight, sending an email, and so on). In various embodiments,the images, screenshots, and/or videos of the performances of thoseactions can be replayed exactly, not just to view and/or study them, butalso to actually redo, repeat, and/or reperform the recorded actions.For example, if images, screenshots, and/or videos of the user orderinga good (e.g., body soap) from the internet were received/recorded by theinput component 106, the action/task captured in those images,screenshots, and/or videos can be repeated, at the keyboard and/or vocalcommand of the user, via the action component 116 such that another unitof that identical good (e.g., body soap) can be reordered by thesystem/personal AI. In various other embodiments, such recordedactions/tasks can be repeated in the abstract (e.g., forming generalizedtask patterns). For example, the system/personal AI can recognize thatcertain recorded images, screenshots, and/or videos show the userbooking a particular flight with a particular airline. At the keyboardand/or vocal command of the user, the system/personal AI can, via theaction component 116, redo/repeat that task pattern in the abstract(e.g., booking a flight for the user on a desired date, with a desiredairline, and/or at a desired price, even though such desired date,airline, and/or price do not match the ones shown in the recordedimages, screenshots, and/or videos). Those of ordinary skill in the artwill appreciate that such exact and/or abstract redo/replay of recordedactions/tasks can be performed regardless of the type of recordedaction/task or device. Moreover, those of skill will also appreciatethat such exact and/or abstract redo/replay can be based on any type ofdata available to the system (e.g., video, audio, motion, location,bio-data, and so on).

Moreover, while these abstract patterns can be generated by studying theactions/patterns of a single user, they can also be generated bystudying the actions/patterns of any number of users (e.g., thousands ofusers, and so on). Thus, the “best” (e.g., least expensive, mostefficient, quickest, most secure, most reliable, and so on) way ofperforming an abstract pattern can be determined by analyzing and/orcomparing how a plurality of users have previously performed thepattern. For example, consider the overall action of putting an item forsale on eBay. For many people, placing an item up for sale on eBay canbe a multistep process (e.g., listing the item, including picturesand/or a written description of the item, printing shipping labels,scheduling shipping services, interacting with a third-party securedtransaction processor (e.g., PayPal), and so on). The system/personal AIcan create an abstract pattern by recording, analyzing, and/or learninghow a single user performs these steps on his/her computing device(s).However, if the system/personal AI has access to the data of hundredsand/or thousands of users, the system/personal AI can generate one ormore abstract patterns that cover a variety of categories (e.g.,different patterns for selling different types of goods, differentpatterns for providing different types of photos/descriptions, differentpatterns for using/obtaining different types of shipping services, andso on), which patterns can then be compared and/or analyzed to find theoptimal pattern(s). By analyzing such a multitude of users' patterns(e.g., thousands of patterns/examples of “putting things for sale oneBay”), the system/personal AI can reliably determine/learn what typesand/or pieces of information/data are required, which data fields needto be filled out, which shipping services are the most reliable, whichpatterns are the most profitable, and so on.

In various embodiments, multiple task patterns (e.g., abstract actions,as described above) can be chained together to build larger taskpatterns. In such case, the larger task pattern can correspond to acomplicated and/or overall goal/action to be performed, and the smallertask patterns can correspond to the constituent actions/steps to beperformed in order to achieve the overall action. For example, if theuser instructs the system/personal AI to make an online appointment witha drywall installer, the system/personal AI can perform that overallgoal/action by performing a sequence/chain of smaller tasks. Thesesmaller tasks/task patterns can include searching for nearby drywallinstallers, vetting and/or checking credentials of located drywallinstallers, choosing a drywall installer (e.g., based on price, onlinereviews, and so on), interacting with the website of the chosen drywallinstaller so as to book an appointment, inputting payment and/or addressinformation, and so on. In some embodiments, the system/personal AI canhave previously recorded the user (and/or any number of other users)performing such smaller tasks, and so the system/personal AI canrecognize the abstract task patterns underlying each constituent step,as described above. By chaining these general task patterns together(e.g., performing them in sequence), the overall, more complicatedgoal/action can be achieved. In some cases, the system/personal AI canmeasure the performance efficiency/efficacy of the chained taskpatterns, so as to compare with different potential pattern chains foraccomplishing the same overall goal/action.

In one or more embodiments, the system (e.g., via the avatar component302 and/or the personal AI) can create a Lifegraph that can representall and/or a subset of all the people, objects, and/or relationshipsthat are detectable (e.g., via image and/or optical recognitionanalysis, any other type of data analysis, and so on) in the images,screenshots, and/or other extrinsic data 107 that are received and/orrecorded by the input component 106. As mentioned above, this canfurther include any and/or all data received by the input component 106,including visual data, non-visual data, bio-data, audio data,geolocation data, motion data, weather data, temperature data, and soon. A Lifegraph (e.g., a Model of the World) can be an n-dimensional(e.g., where n is a positive integer) graph that can represent and/ordepict the web of people/objects/relationships connected to the userand/or the system. In some embodiments, the Lifegraph can depict and/orrecord how these people/objects/relationships change through/over time.For example, if a user sent an email to person A, held atelecommunication session (e.g., via Skype) with person B, and instantmessaged and/or posted a message to person C (e.g., via Face Book), thesystem and/or personal AI can recognize that all three of person A,person B, and person C are somehow connected to the user of the system,and so those three persons can be included and/or represented in theLifegraph of the user. Furthermore, the system and/or personal AI can,in various embodiments, recognize the context in which such personsappear in the recorded data 107 to further define the relationshipsbetween the user and those persons. For instance, if the email to personA and the telecommunication session with person B both pertained to aparticular academic/work project/assignment, the Lifegraph can show thatperson A, person B, and the user are all connected in some way via thatparticular academic/work project/assignment. Similarly, if the instantmessage to person C did not pertain to that project/assignment, butinstead was related to a dinner date, the Lifegraph can show that personC and the user are connected via that dinner date and not via thatproject/assignment. If more specific information about person A, personB, and person C is discernable from the recorded data 107 and/orotherwise available to the system/personal AI, such information canfurther be included in the Lifegraph (e.g., names, ages, occupations,and/or familial relations of persons A, B, and C, and so on).

Furthermore, data points, nodes, and/or entries in the Lifegraph can, insome embodiments, be generated by other data streams. For instance, amobile application can provide location information regarding the userand/or persons A, B, and/or C. So, if a mobile device's location sensor(e.g., GPS sensor, and so on) indicates that the user was at aparticular latitude/longitude that corresponds to a gym, thesystem/personal AI can determine that the recorded latitude/longitudecorresponds to the gym (e.g., via an internet search) and can thusinclude the gym on the user's Lifegraph. Similarly, because theLifegraph can track people, objects, and/or relationships through time,temporal correlation can be used to further interrelate nodes, entries,and/or data in the Lifegraph. For example, if the user meets withpersons A and B in a latitude/longitude that corresponds to a school andsuch meeting occurs at and/or around the same time that the user andpersons A and B send documents to each other pertaining to a particularproject, the system/personal AI can note in the Lifegraph that theschool location is relevant to the project, and so on.

Moreover, the Lifegraph generated by the system/personal AI can, in oneor more embodiments, be queried by the user, the personal AI, and/or athird party (e.g., with sufficient authorization). For example, the usercan ask the system and/or personal AI to send a Christmas card/messageto the user's “close family contacts.” In such case, the personal AI canquery the Lifegraph to determine who from the list of people with whomthe user interacted can be considered a personal family contact of theuser. As explained above, a list of personal family contacts can beincluded in the Lifegraph by examining and/or analyzing the recordeddata 107 (e.g., emails, calls, live messaging, extrinsic data, and soon). Continuing the example from above, person C can, in some cases, beconsidered a close contact since the subject of a recorded communicationbetween person C and the user pertained to a dinner date (e.g., whichseems personal and not work- and/or school-related). Persons A and B, onthe other hand, can be considered not close family contacts since thesubjects of the recorded communications between them and the userpertained to work/school assignments only, thereby not indicating aclose and/or personal relationship. However, if other images,screenshots, and/or data 107 received/recorded by the input component106 contained communications between the user and persons A and B in amore personal context (e.g., emails between the user and persons A and Bdiscussing whether they would attend a certain family reunion, and soon), the Lifegraph can indicate that persons A and B are, indeed, closefamilial contacts of the user.

In one or more embodiments, the Lifegraph can include user preferencesthat can be deduced from the recorded people, objects, and/orrelationships and/or their changes over time. The incorporation ofscreen data and/or non-screen data (e.g., bio-data, location data,motion data, audio data, time data, weather data, and so on) can be usedto facilitate deduction of such preferences. For instance, the Lifegraphcan record that the user attends a gym (e.g., via geolocation data froma mobile device, and so on) periodically (e.g., with a regular and/orsomewhat regular time interval between gym visits). If the time intervalbetween gym visits is relatively short (e.g., a few days), the Lifegraphcan note that the user prefers to go to the gym and/or exerciseregularly. If, on the other hand, the time interval between gym visitsis relatively long (e.g., a few weeks and/or months), the Lifegraph cannote that the user prefers to not go to the gym regularly. Any othertypes of user preferences can be deduced and/or stored in the Lifegraph.For example, if the user often buys an aisle and/or window seat onflights, the Lifegraph can deduce that the user prefers the aisle and/orwindow seat. Moreover, such preferences (and/or any other information inthe Lifegraph) can be available for querying by the personal AI. Forinstance, if the user researches and purchases a particular car (e.g.,purchased online or discussed the purchase via email, and so on), theinformation about the car uncovered during the online research and/orvia the emails (e.g., chassis number, manufacturer, model, price, and soon) can be stored in the Lifegraph and can be available for querying.So, the user can command the personal AI to research insurance optionsfor their vehicle, and the personal AI can access such information toresearch available insurance options without having to manually receivethe input (e.g., chassis, model, price, and so on) from the user. Insome embodiments, the personal AI can begin obtaining the insurance(e.g., filling out online information/data fields), such that only theuser's final authorization and/or signature is needed to finalizehis/her purchase of automobile insurance.

In various other embodiments, the user can navigate through theLifegraph in time; that is, the user can see how the Lifegraph evolvedand/or changed over time. This temporal data can then be used in variousanalyses to answer various questions (e.g., How many new relationshipshas the user created in the time interval?; How many projects has theuser started and completed during the time interval?; How have theuser's relationship intensities changed over time?; How have the user'sinterests and/or preferences shifted over time?; and so on).

In various embodiments, the system and/or personal AI can create aPersonal Timeline/Timeseries of the user and/or the system. For example,all the recorded images, screenshots, videos, and/or other extrinsicdata 107 (which can, again, include non-visual/non-screen data such asmotion data, audio data, location data, bio-data, and so on) can bemultiplexed together into a single structure, which structure can berepresented as a file, a video, a database, and so on. The PersonalTimeline can represent the computing history (e.g., both direct computedata streams and/or non-direct-compute data streams, and so on) of theuser (e.g., all the interactions that the user has had and/or performedon the computing device and/or other computing devices). Such a PersonalTimeline can represent a central, comprehensive history of the user'scomputing interactions (e.g., containing all the user's on-screeninteractions from their personal computer, their mobile device(s), theirdesktop computers, their gaming consoles, and so on; and including alltheir non-on-screen interactions, such as geolocation data from mobiledevices, audio and/or motion data from various other devices, bio-datafrom bio sensors, information/data from smart appliances, smart cars,smart houses, and so on), and can thus be queried accordingly. That is,all and/or any data regarding the user from all and/or any of the user'ssmart devices can be multiplexed into a single PersonalTimeline/Timeseries, thereby correlating all of such data in time andallowing for easy/efficient navigation through such data by the userand/or the personal AI. Those of skill in the art will understand thatthe Personal Timeline can represent shorter and/or limited spans oftime, in some embodiments. In some cases, the order in which the data(e.g., again, any data from any smart device can be multiplexed into thePersonal Timeline, including motion, audio, video, bio, location,temperature, weather, time, and so on) are stored in the PersonalTimeline can represent the order in which such data actually occurredand/or was collected chronologically.

In one or more embodiments, various points on the Personal Timeline canbe tagged and/or marked for future reference. Additionally, the usercan, in some cases, add annotations and/or comments to a particularmarked/tagged point on the Personal Timeline. Furthermore, documentsand/or files (e.g., downloaded files, edited/created documents, and soon) can be attached to various points in the Personal Timeline, such asto the point at which a particular document was created and/ordownloaded. This would allow the user to view and/or access thatdocument and/or file when navigating through the Personal Timeline. Itcould also visually and/or otherwise correlate that document/file withsome, any, and/or all other data in the Personal Timeline that areassociated with that point in time. Moreover, such tags, marks,annotations, and/or files/documents can be queried and/or accessed fromthe Personal Timeline by an application programming interface, and soon.

For instance, the Personal Timeline can, in some cases, be a video file(or a set of images taken substantially periodically over time) alongwith any other metadata (e.g., audio data, time data, location data, andso on) documenting all the user's actions on the computing device(s) forthe past several months. If the user has been working on a long-termassignment/project, the user can query and/or navigate the PersonalTimeline so as to see images, video clips, and/or any other datapertaining to the user's work on the project over the past severalmonths. If the user found a particular image and/or video snippet thatwas of particular importance (e.g., an image of a special website sourcethat was consulted for the project, and so on), the user can mark/tagthat image/video segment in the Personal Timeline and can add anappropriate annotation for later reference. Again, the Personal Timelinecan include any type of data collected by the input component 106,including visual/on-screen data and/or non-on-screen data (e.g., audio,motion, location, weather, temperature, biological, temporal, and soon).

Furthermore, the Personal Timeline can, in various embodiments,correspond to (and/or be used to create) a stored visual history of allvisited and/or viewed webpages and/or programs that were displayed onscreen (and any other data collected by the system, includingnon-on-screen data, as described above). In some embodiments, the fullvisual history can be available through Spotlight and/or Microsoftanalog for navigation/querying by the user.

In some cases, the visual history can include the percentage of adisplayed webpage that was actually displayed on screen (e.g., if a useropens a scrollable webpage and scrolls through only 75% of it, thePersonal Timeline/visual history can note that only 75% of the webpagewas viewed at that point in time. In some embodiments, the amount of aparticular webpage that was unscrolled/unviewed can be displayed/noted.

In still other embodiments, the Personal Timeline/visual history cankeep track of all windows that were opened by the user, whether or notthose windows were actually displayed on screen (e.g., hidden by otherwindows, minimized, and so on).

In some embodiments, a plurality of users (e.g., thousands and/ormillions of users) can use the disclosed system. In such case, acentral, global storage pool of website data can be created, which caninclude any and/or all website pages and/or screens seen (and/or openedbut hidden) by any and/or all of the users. Thus, the global storagepool can be considered a substantially complete record of all historicalwebsite data viewed by any of the users, thereby allowing any recordedwebsite to be requested/summoned, including website pages that are nolonger functional and/or available on the internet.

In one or more embodiments, the system/personal AI can perform automatedclustering of timeline segments from the Personal Timeline/visualhistory. As explained above, the Personal Timeline/visual history canrepresent the entire on-screen history of the computing device (e.g.,every screen image displayed by the computing device and/or otherdevices for a defined and/or indefinite time period) as well as anyother data from any and/or all of the user's other devices (e.g., audiodata, motion data, location data, weather data, bio-sensor data, timedata, temperature data, and so on). However, in some cases, not everydata cluster is substantively related to the other data clusters in thePersonal Timeline. In such case, the system/personal AI can divide thePersonal Timeline and/or visual history into separate segments/clustersthat correspond to a particular substantive task/action performed by theuser. This can improve the searching and/or querying efficiencypertaining to navigating through the Personal Timeline/visual history.This can also help to teach the system/personal AI to facilitate thedetection/classification of user intentions and/or actions. Moreover,this can help to teach the system/personal AI to determine whichtasks/assignments the user has worked on without being distracted andwhich tasks/assignments the user worked on with significant distractionsand/or breaks/lapses in focus.

For example, consider a Personal Timeline that contains thirty storedminutes of screen image and/or video data (and/or any other data, asdescribed above). The first ten minutes of the Personal Timeline cancontain screen images/videos of the user working on a schoolpaper/assignment. The next five minutes can contain screen images/videosof the user checking an email account. The next three minutes cancontain screen images/videos of the user browsing on the internet.Finally, the last twelve minutes can contain screen images/videos of theuser continuing to work on the school paper/assignment. Thesystem/personal AI can recognize that the first ten minutes canconstitute a first time segment/cluster pertaining to the substantivetask/action of writing a school paper. This cluster/segment can beaccordingly marked, tagged, annotated, and/or otherwise distinguishedfrom the rest of the data in the Personal Timeline/visual history (e.g.,automatically marked and annotated by the system/personal AI, and soon). The system/personal AI can also recognize that the next five-minuteand three-minute intervals respectively pertain to the separatesubstantive tasks/actions of checking emails and browsing the internet,and do not pertain to the substantive task/action of writing the paper.So, a second time segment and a third time segment can be accordinglymarked/tagged/annotated in the Personal Timeline by the system/personalAI. Lastly, the system/personal AI can recognize that the final twelveminutes pertain to the substantive task/action of writing the paper.Thus, those twelve minutes can be marked/tagged/annotated as being partof the same time segment/cluster as the first time segment (e.g.pertaining to the paper/assignment). Now, if the internet browsinginvolved searching for sources for the paper/assignment, thesystem/personal AI can recognize this (e.g., via image and/or wordanalysis), and thereby include that three-minute interval in the firsttime segment/cluster (e.g., pertaining to the paper).

Those of skill in the art will appreciate that various embodiments ofthe present invention are not limited to segmenting the PersonalTimeline/visual history into minute segments. Instead, any time intervalcan be used (e.g., years, months, weeks, days, hours, minutes, seconds,sub-seconds, and so on). Indeed, the Personal Timeline/visual historycan be segmented into infinitely small segments of time and is notconstrained to clustering/segmenting based on periodic, regular, and/orrecurring time intervals. Additionally, those of skill in the art willappreciate that the clustering/segmenting discussed above can beimplemented with any type of data stored in the Personal Timeline/visualhistory (e.g., visual/on-screen data, audio data, motion data,bio-sensor data, location data, weather data, temperature data, and soon).

In various embodiments, the system/personal AI can perform suchclustering/segmenting by using image analysis, focus object/sectionanalysis, wordspace analysis, audio analysis, bio-data analysis, and/orany other data analysis and/or detection technique. In variousembodiments, the system/personal AI can determine that all screen imageswhich display at least part of a particular image and/or focusobject/section are related to the same substantive task/action. Tocontinue the above example, the system/personal AI can, in some cases,recognize that the user was working on the paper/assignment during thefirst ten minutes and during the last twelve minutes because the worddocument in which the user was writing the paper/assignment wasdisplayed on the screen of the computing device during those timeperiods. Similarly, the system/personal AI can determine that thefive-minute and three-minute intervals did not pertain to thesubstantive task/action of writing the paper because the word documentin which the user was writing the paper/assignment was not displayed onthe screen (e.g., minimized and/or hidden behind the email webpageand/or internet webpage, and so on). Likewise, the system/personal AIcan determine that only the five-minute time period pertained tochecking emails since the email account webpage was displayed on thescreen only during those five minutes, and so on. Now, if thepaper/assignment pertained to a certain topic (e.g., American colonialhistory), the system/personal AI can use wordspace analysis and/orobject detection to further interrelate time segments. For example, thesystem/personal AI can recognize key terms (e.g., via OCR, and so on) inthe word document, and can determine that such key words (e.g.,“American history,” “colonial,” “George Washington,” and so on) appearedon the screen during the three-minute time interval in which the userbrowsed the internet. Thus, the system/personal AI can mark, tag, and/orannotate that the three-minute interval is indeed related to thepaper/assignment, notwithstanding that the focus object (e.g., the worddocument) was hidden and/or otherwise not displayed on the screen duringthat time period. Moreover, a combination of wordspace and objectdetection can be used to facilitate such features.

In various embodiments, the Personal Timeline, the visual history,and/or the monitoring of nonvisible/hidden windows can be simultaneouslyleveraged by the system/personal AI in order to build a background modelof relatedness that can present the user with specialized work roomsdepending on the substantive task/action on which the user is currentlyworking. For example, if the user is preparing a word document relatingto a school project (e.g., engineering lab report), the system/personalAI can detect the word document (e.g., via image/OCR analysis) and thetopic of the paper (e.g., via wordspace analysis). Moreover, thesystem/personal AI can automatically minimize, cancel, and/or otherwisehide any other windows, programs, and/or notifications that are notsubstantively related to the task at hand (e.g., writing the engineeringlab report). Thus, an open Facebook webpage can be hidden and/orotherwise not displayed on the screen, while an email account displayinga conversation with a classmate of the user about the engineering labreport (e.g., again, recognizable via wordspace analysis) can remain onthe screen, be brought to the front of the screen if otherwise hidden,and/or be resized so as to share the screen with the word document.Additionally, an internet webpage that pertains to the topic of theengineering lab report (again, recognizable via image and/or wordspaceanalysis) can also be brought to the front of the screen along with theemail conversation and the word document, thereby allowing the user toquickly view all windows and/or programs that are pertinent to the labreport without having to sift through multiple hidden windows, therebyavoiding distraction. In this way, the open windows/programs that aremost relevant to the user's current task/action can be automaticallypresented on the display screen while those that are least relevant canbe blocked, suppressed, and/or hidden. This can also occur based onother, non-visual information to preserve a moment of flow (e.g., momentwhen the user is most concentrated and/or in a creative sweetspot). Forinstance, if the system/personal AI detects that the user is focusedand/or concentrating (e.g., via bio-data and/or EEG readings, and soon), windows, programs, and/or notifications that are relevant to thetask on which the user is focused can be displayed on screen while thosethat are irrelevant and/or potentially distracting can be suppressedand/or hidden.

Furthermore, in various embodiments, the system/personal AI can keeppotentially relevant windows, programs, and/or notifications (e.g., thatare not necessarily pertinent at the moment) prepared and/or at hand,such that the user can quickly/easily access them if needed withoutbecoming significantly distracted or losing his/her train of thought. Tocontinue the above example, the engineering lab report may require theinclusion of several graphs. If the user is not yet prepared to includethe graphs but has nonetheless opened/accessed the graph files, thesystem/personal AI can minimize the graphs, place them in a corner ofthe screen, and/or keep them behind a quick interaction button/icon,allowing the user to quickly and easily access the graphs when needed.This can significantly improve the flow of work by reducing the timespent by the user on simply searching for the desired files (e.g.,sifting through a plurality of open and/or hidden windows).

In various embodiments, the Personal Timeline, visual history, and/orother stored images, screenshots, and/or extrinsic data 107 (e.g., anytype of data recorded by any type of device, as described above) cancontain/depict sensitive and/or private information (e.g., credit cardnumbers, social security numbers, home addresses, passwords, and so on).The system/personal AI can ignore and/or otherwise camouflage (e.g., viablurring) such sensitive information to ensure that such information isnot accidentally revealed if someone other than the user queries and/ornavigates the Personal Timeline and/or visual history.

Such sensitive information can be recognized via the above-mentionedimage analysis. In various embodiments, such user specific data can beotherwise recognized by identifying what information particular datafields are requesting from the user. Additionally, if there are multipleusers of the system, the system/personal AI can compare the on-screenimages (and/or any other data, as mentioned above) of the various usersto determine which information is user specific and which is not. Forinstance, if a plurality of users inputted information into an airlinewebsite to purchase tickets, the system/personal AI could compare thevarious airline webpage screens from the various users to determine thatthe differences between the screens (e.g., user names, addresses,payment information, and so on) are user-specific data while theunchanging portions between the screens (e.g., airline instructions, andso on) are not user-specific. Moreover, in such case, different userscan be shown (and alerted to the fact that they are shown) variations ofdata shown to other users. For instance, different users may look at thesame airline ticket website and yet be shown different frequent flierpoints and/or bonus miles (because such points and/or bonus miles candepend on the user and/or whether the user is viewing a particularoffer/advertisement, and so on). Thus, the system/personal AI can informa user that he/she is viewing a better and/or worse deal than otherusers, and so on.

Once identified, the user specific information can be automaticallyblurred and/or pixelated. As another example, private communications inan email program or an instant messaging application can be blurred, andso on. Similarly, personal information on any and/or all screen sharingsessions, transmitted video recordings of the user's screen(s), and/orany other communication/transmission from the user's device(s) can becensored and/or blurred in real-time by the system/personal AI to ensurethat such personal information is not unintentionally shared withunauthorized third-parties.

Similarly, in various embodiments, the system/personal AI can beconfigured to use private/sensitive user information related to aparticular user account (e.g., credit card number, social securitynumber, account password, and so on) without being able to read and/orotherwise reveal such information to third parties withoutauthorization. For example, if a user orders a good/service from anonline vendor, the personal AI can interact with the vendor's website,provide the user's payment information, and thereby order thegood/service. However, the personal AI can be prevented from otherwisesending and/or revealing such private information to other third parties(e.g., third parties who are not bona fide online vendors) without firstreceiving some type of affirmative authorization from the user. In someembodiments, the user can provide advance authorization for the personalAI to share such information.

In one or more embodiments, the Personal Timeline/visual history can beused to detect the creation of loose-end commitments (e.g., otherwiseknown as “open-loops”) and/or to get confirmation on whether suchloose-end commitments have been resolved. That is, the system can detect(e.g., via image analysis, OCR, and/or word analysis by the machinelearning component 112 and/or the analysis component 114) an open-endedquestion, request, invitation, query, and so on generated by the userand/or received by the user but that has not yet been resolved. Forexample, if the user receives a message (e.g., email) from person Xasking “Do you want to meet at 3:00 pm?,” the system/personal AI cannote when, where, and how the message was received (e.g., viatags/marks/annotations in the Personal Timeline and/or visual history)and can mark the question/email as an open-loop until the user respondsto it. In some embodiments, the system/personal AI can ask the user at alater time whether or not the user answered/addressed thequestion/open-loop. As another example, if the user sends a message toperson Y asking “How's the project?,” the system/personal AI can keeptrack of when, where, and how that message was sent and can mark themessage/question as an open-loop until person Y responds. In some cases,the system/personal AI can notify the user at a later time if a responsehas been received. If a response has not yet been received, the personalAI can, in some cases, send a notification to that effect to the user.In various embodiments, an inventory of these loose-ends/open-loops canbe created, which inventory can be navigated/queried by the user.

In one or more embodiments, the system and/or personal AI can usebio-data (e.g., EEG data, heartrate data, body temperature data, motiondata, micromovement data, breathing data, and so on) described above(which can also be included in the Personal Timeline/Timeseries) to helpmonitor the emotions, sentiments, thoughts, and/or intentions of theuser. More specifically, the user can, in various embodiments, operatethe system/computing device(s) while wearing a suitable EEG device,heartrate monitor, thermometer, motion-capture device, and so on. Insuch case, one or more of the user's brainwaves, facial expressions,and/or other bodily responses can be recorded as the images,screenshots, and/or other extrinsic data 107 (again, which can includeany data from any type of device) are simultaneously received/recordedby the input component 106. So, the bio-data can be correlated in timewith the recorded images, screenshots, and so on (e.g., multiplexedand/or attached to the Personal Timeline/visual history, and so on).These correlations can teach the system/personal AI how the user's brainand/or body characteristically respond to certain on-screen and/oroff-screen stimuli (e.g., notifications, pop-ups, received emails,sounds, locations, movements, and so on). Moreover, such bio-data canthen be used to help predict the user's intention and/or behavior (e.g.,by recognizing previously recorded brainwave patterns, bodily responses,and so on). As an example, the system/personal AI can record brainwavesand/or bodily responses of the user, which appear similar to brainwavesand/or bodily responses previously recorded when the user was in amental state in which the user was highly productive and not distracted.In such case, the system/personal AI can recognize that the user hasentered a creative sweet spot, and so the system/personal AI cantemporarily suppress notifications, pop-ups, and/or other on-screenand/or off-screen distractions that could potentially bother the userand/or otherwise knock the user out of the mental sweet spot.

In various embodiments, the system/personal AI can leverage the data(e.g., bio-data and/or any other type of data receivable by the inputcomponent 106) to detect pattern precursors to distractions. Forexample, the system/personal AI can detect certain on-screen and/oroff-screen patterns that suggest that the user is distracted (e.g.,aimless clicking/scrolling, jiggling the cursor, and so on). Theseon-screen and/or off-screen patterns can, in some cases, be verifiedwith the bio-data (e.g., as the user performs such distractedactions/tendencies, the user's brainwaves and/or bodily responsesdisplay patterns that the user previously displayed when distracted,such as fidgeting/restless legs, sighing, daydreaming, and so on). Bydetecting these distraction precursors, the system/personal AI can takeremedial action if possible (e.g., suppressing notifications, closingunnecessary/distracting windows/programs, notifying the user thatdistraction and/or lack of concentration are imminent/occurring, and soon). Moreover, the system/personal AI can use the collected bio-data(and/or any other data) to create a productivity score for the user(e.g., a score indicating how productive/unproductive the user currentlyis based on their brainwaves, bodily responses, and/or on-screenactions).

Similarly, the system can recognize various triggers from the EEG data(and/or any other bio-data). In some cases, the system can passivelylearn (e.g., as explained above) how the user's brain and/or bodyrespond to various stimuli/triggers. In some cases, the system canfacilitate active training to recognize how the user's brain and/or bodyrespond to various stimuli/triggers (e.g., showing the user imagesand/or other sensory input and then recording the user's brainwavesand/or other bodily responses, like micromovements, heart rate,breathing rate, temperature, and so on). Such measured brainwave and/orbodily response patterns can represent a user's intention to perform aparticular task and/or action. Specifically, the system can learn whichbrainwave and/or bodily response patterns of the usercorrelate/correspond to which intended actions (as described above), andthe system can then perform those actions/tasks and/or begin preparationfor those actions/tasks. For example, the system could detect thebrainwaves and/or bodily responses that are recognized (from eitheractive training and/or passive learning) to represent the intention tocreate a note. The system could then prepare the user's device(s) byautomatically unlocking and preopening the note application so that theuser can start typing the note without having to first pick up thedevice, unlock it, find the correct note-taking application, open it, gointo the editing mode, and finally begin editing. This can greatlyreduce the likelihood that the user forgets what it was that he/shewanted to write down. More generally, the system can detect brainwavesand/or bodily responses that correspond to brainwaves and/or bodilyresponses previously detected immediately before the user selected aparticular application/program. In such case, the system can recognizethat the user intends to select such program again, and so the systemcan automatically do so for the user.

Similarly, in various embodiments, the system can measure whether or notsuch detected intentions are eventually fulfilled. So, if the systemdetects that the user intends to perform a certain task, the system candetermine, and/or query the user regarding, whether that task wasactually performed. For example, if the system determines that the userintends to schedule an appointment with person J, the system can waitfor a defined amount of time (e.g., a day) to see if the user followsthrough with such intention. If the defined amount of time elapses(e.g., a day passes) without the system detecting the creation of suchan appointment (e.g., via image analysis of on-screen displays of one ormore computing devices of the user), the system can notify the user thatsuch appointment has not yet been made, thereby reminding the user. Thesystem can also note (e.g., in the Personal Timeline and/or Lifegraph)that an open-loop has been created (at least until the user respondsand/or otherwise fulfills the intention).

In various embodiments, the system/personal AI can, afterdetecting/determining a user's intent, provide access to the intent on amarketplace (e.g., an online marketplace). This marketplace could allowservice providers and/or product providers to directly, and/or via AIconduits, bid on the opportunity to fulfill a user's intention. This canoccur automatically and/or upon the user's request. In other words,bidding can occur by which individuals/entities on the marketplacecompete (e.g., bid) for an opportunity to fulfill the intent for theuser. For example, if a user wishes to purchase a plane ticket to aparticular destination on a particular date, the airline serviceproviders on the marketplace can bid (e.g., offer prices) to the user,in hopes of getting the user's business. In some cases, the winning bidcan be chosen by the user. In other cases, the winning bidder can getpriority in fulfilling the user's intent (e.g., having the winningprovider communicate with the user via the system/personal AI to discusspotential prices, and so on). In various embodiments,underbidding/competition between the service/product providers canoccur, by which the bidders consecutively decrease their offered pricesin the hopes of obtaining the user's business. In such case, the usercan select a single winning bid that can be instantly confirmed (e.g.,the system/personal AI can automatically begin transacting with thewinning provider to fulfill the user's intent). In some cases, the usercan be presented with a small set of several top bids (e.g., due to acombination of low price and high quality, and so on).

In some embodiments, reverse bidding can occur by which different usersof different computing devices compete (e.g., bid) for an opportunity tohave a scarce service/product provider on the marketplace fulfill theirintent. In other words, while the normal bidding process described abovecan involve multiple service/product providers consecutively loweringthe prices that they are willing to accept so as to convince a user topurchase from them, reverse bidding can include multiple usersconsecutively increasing the prices that they are willing to pay so asto convince a service/product provider to sell to them.

In some cases, limited and/or full access to the user's Lifegraph and/orPersonal Timeline can be granted to the entity that is to fulfill theuser's intent, thereby allowing the entity to better learn how the userwould like their intent fulfilled. For example, the system/personal AIcan detect (as described above) that the user would like to purchasetickets to a particular concert/event. Moreover, the Lifegraph caninclude data indicating that the user has traditionally purchasedconcert tickets corresponding to seats in the front of the concerthall/stadium. So, the system/personal AI can then inform the marketplaceof the user's intent, can search for available concert tickets for seatsat the front of the stadium, and can accept (in the case of bidding)and/or bid for (in the case of reverse bidding) a third-party's offer tofulfill the intent (e.g., to obtain the tickets for the front-row seatsfor the user). In the case of purchasing airline tickets, the Lifegraphof the user can note that the user traditionally prefers aisle seats.So, the system/personal AI can use this data to narrow the marketplacesearch for a service/product provider (e.g., only searching for airlinetickets that have available aisle seats, and so on). In the case ofpurchasing a car or car insurance, the Lifegraph can include anyinformation pertaining to the user's vehicle that the user may haveresearched online and/or discussed in an email (e.g., car ID number,dealership, dealership location, price, and so on). So, thesystem/personal AI can interact with an online service/product provider(e.g., the car dealer's website, the car insurer's website, and so on)and can provide such information from the Lifegraph to the onlineservice/product provider (e.g., pre-filling the data fields in theprovider's website with relevant information from the Lifegraph). Invarious embodiments, the system can even access/query the Lifegraphsand/or Personal Timelines of multiple users (e.g., thousands of users).

In some embodiments, the system/personal AI can allow fulfillment-onlydata access (e.g., access to the user's Lifegraph and/or PersonalTimeline) to the market entity. Because some specific intents canrequire the input of sensitive/private information (e.g., credit cardnumber, social security number, home address, and so on), the marketentity can be granted access to such data of the user (e.g., Lifegraph,Personal Timeline, and so on). However, to prevent misuse of such data,the system/personal AI can encrypt the data and/or otherwise protect thedata such that it can be used (e.g., for fulfillment of the intent)without being otherwise read, copied, and/or extracted from the user'sLifegraph, Personal Timeline, and so on.

In one or more embodiments, the Personal Timeline can display all filesand/or documents that were downloaded on one or more user devices overany given time interval. Conventionally, the “download folder” on mostpersonal computing devices receives every file downloaded onto thedevice, regardless of file type, time of download, and so on. To helpmake sense of this “file dump,” the system can, in various embodiments,include all the downloaded files/documents on the Personal Timelineaccording to type of file/document, when downloaded, source of thedownload, relating to certain user intents/actions/tasks, and/or so on.In some cases, the downloaded files/documents can be appended to thePersonal Timeline (e.g., similar to annotations). In other cases, thedownloaded files/documents can be displayed on a dedicated “downloadedfiles” view of the Personal Timeline. For example, the user could searchthe Personal Timeline for all downloaded files/documents according toany kind of search criteria (e.g., all files that were downloaded withinthe past two weeks, all files that were downloaded from a particularwebpage, all files that were downloaded while the user was at aparticular location, and so on). In some embodiments, theinterrelationships between these various downloaded files can berepresented visually in a manner similar to the Lifegraph (e.g., a webof interconnected icons representing the files, and so on). Those ofskill in the art will appreciate that the Personal Timeline can be usedin this manner for any collection of files, and not solely fordownloaded files.

In one or more embodiments, the Personal Timeline and/or Lifegraph ofthe user can be stored on computer-readable storage media and/or in thecloud and can be downloaded from and/or onto one or more user devices.As explained above, the Personal Timeline and/or Lifegraph of a user cantranscend any particular computing device. Although the PersonalTimeline and/or Lifegraph can represent the comprehensive on-screenand/or off-screen computing history of a given computing device (e.g.,laptop, desktop, personal digital assistant, smart phone, and so on), itcan, in some embodiments, represent the comprehensive on-screen and/oroff-screen computing history of the user himself/herself (e.g., of allthe smart devices and/or sensors that the user uses, including laptop,desktop, personal digital assistant, smart phone, game console,bio-sensors, smart house, smart car, smart appliances, and so on). Insuch case, the user's Personal Timeline and/or Lifegraph can beconsidered as a user profile and/or user account that can contain anyand/or all computing data pertaining to the user and which can betransferred and/or shared between computing devices. In suchembodiments, various computing devices and/or applications/programs thatrequest user accounts/profiles can be satisfied by receiving a copy ofthe user's Personal Timeline and/or Lifegraph.

In various embodiments, the system/personal AI can extract and/oridentify a core set of relevant variables out of a multi-variable systemdisplayed on the computing screen, thereby further improving the abilityof the personal AI to learn and/or otherwise interact with the displayedprogram/webpage. Because, in some cases, not all the possible variablescarry substantive importance to the user's actions on the computingdevice(s), the system/personal AI can ignore such minimally importantvariables and instead focus on learning the most/more important ones.For example, a particular webpage (e.g., Facebook page) can have aplethora of variables and/or clickable links on it (e.g., a post button,a delete button, an edit button, a like button, a comment button, ashare button, an add-friend button, various hyperlink buttons, asettings button, a marketplace button, a camera/photos button, an“about” button, a messages button, and so on). The system/personal AIcan, in some embodiments, be explicitly programmed (e.g., hardcoded) todetect and/or pay attention to only a select subset of those manyvariables. In other embodiments, the system/personal AI can infer, basedon watching which variables the user routinely interacts with, whichvariables are relevant and which are not.

In one or more embodiments, the system can be implemented tocomprehensively record employee actions and/or computing histories. Forexample, an employer can require his/her employees to use computingdevices outfitted with one or more embodiments of the disclosed system.In such case, each employee's computing behavior (e.g., on-screen and,in some cases, off-screen) can be comprehensively recorded and storedfor later review. This can enable the employer to reliably monitorhis/her employees (e.g., to search for slackers, to provide recordevidence in pay and/or labor disputes, and so on), which can otherwisebe daunting and/or nearly impossible if very large numbers of employeesrequire monitoring. Moreover, the system can analyze such employeerecords to compile a list of notable computing actions/tasks (e.g.,Which tasks are the most commonly performed by the employees overall?;Which tasks are the most/least time-consuming overall?; Which taskpatterns expose the company to the least cyber security risk?; and soon). By compiling this information and making it readily available tothe employer, the employer can determine how to streamline and/orotherwise improve his/her employees' computing behaviors. For example,the system can determine that most employees have toconstantly/frequently open a particular folder, select a particularfile, copy information from that particular file, and paste it indifferent particular file. In such case, the employer can decide tostreamline the process for the employees by creating pre-filled formsthat already have the requisite information copied therein (therebyeliminating the copy/paste steps) and/or to otherwise make a shortcut tothe copy/paste file (thereby reducing the time it takes to complete thecopy/paste task), and so on.

The following figures illustrate potential instantiations of theabove-described embodiments. FIG. 4 illustrates an example dashboardinterface 400 in accordance with system 100. Section 402 of theinterface illustrates a time series chart of actions (e.g., opened upChrome, checked stock market prices, zoomed in on an image, handledemail, engaged in a Skype call, and so on) taken by a user in connectionwith computing interaction. A gaming component interface 404 representsstatus of training of the model 110 and associated functionalitiesachieved as well as path to achieving a comprehensively trained model(e.g., level of evolving). Section 406 of the dashboard providesinterface(s) for training the model 110, where a user can explicitlyinput (e.g., type, verbal, gesture, and so on) intentions, and the model110 can interact with the user by asking questions or seekingconfirmation regarding determined or inferred intent. Users can specifyintentions or actions through dialog boxes and/or via voice commands Atsection 410, a user can input tasks into a task box for the model 110 tofacilitate executing, e.g., autonomously or partially. At 412, the usercan confirm completion of the task, rate the performance, re-task, andso on.

FIG. 5 illustrates a detailed example of the dashboard time-line 402with respective identified actions presented in time series, andprogress of training of the gaming component interface. Section 406 isillustrated in a manner where intentions can be labeled and explicitlyprovided with deep granularity.

FIG. 6 illustrates an interface where the system 100 is learningintentions of a user, and can categorize intentions in the form of alibrary of intentions to draw upon in connection with determining,predicting or inferring intentions and taking action based on suchinference, prediction or determination. A catalogue of collected screensis kept to recalculate and refine intentions as the algorithm qualityimproves. Additionally, similarity and confidence scores can bedisplayed here.

FIG. 7 illustrates an interface showing status of the gaming componentincluding tasks that can be accomplished by the system 100. FIG. 7illustrates an interface showing a visual representation of the AI, theproductivity score representing the time gained by having GO take overtasks and gaming components showing the user's personal score and theuser's current rank on the leaderboard. Current Tasks being performed bythe AI are shown with current estimates for turnover/completion times. Aspecific status workflow shows each task's position within the processof learning to the process of completion.

FIG. 8 illustrates an example, non-limiting methodology 800 inconnection with an implementation to facilitate determining or inferringuser intent. For simplicity of explanation, the computer-implementedmethod(s) are depicted and described as a series of acts. It is to beunderstood and appreciated that the subject innovation is not limited bythe acts illustrated and/or by the order of acts; for example, acts canoccur in various orders and/or concurrently, and with other acts notpresented and described herein. Furthermore, not all illustrated actscan be required to implement the computer-implemented methodologies inaccordance with the disclosed subject matter. In addition, those skilledin the art will understand and appreciate that the computer-implementedmethodologies could alternatively be represented as a series ofinterrelated states via a state diagram or events. Additionally, itshould be further appreciated that the computer-implementedmethodologies disclosed hereinafter and throughout this specificationare capable of being stored on an article of manufacture to facilitatetransporting and transferring such computer-implemented methodologies tocomputers. The term article of manufacture, as used herein, is intendedto encompass a computer program accessible from any computer-readabledevice or storage media.

At 802, the methodology receives (e.g., using an input component)time-series data regarding state of a subset of influencers (e.g.,screen captures of a display, audio information, motion information,location information, optical character recognition (OCR) information,video information, 360°/panoramic recordings, and so on) with respect touser interaction in a computing environment. The time-series data isanalyzed and employed at 804 to train a model in connection withdetermining, predicting or inferring a user's intentions. At 806, themethod receives or solicits user input regarding intent or confirmationof intent in connection with further training the model, e.g.,implicitly or explicitly. At 808, a gamification component is displayedto the user representing level of training of the model and capabilitiesor functionalities associated with the model as currently trained. In anaspect the gamification component information can be shared with thirdparties to facilitate sharing of training status. At 810, the model cantake automated action based on its training and determined, predicted orinferred intent while employing a utility-based analysis where thebenefits of taking a correct automated action are weighed against thecosts associated with taking incorrect automated action.

Turning now to FIGS. 9A-9J, an embodiment of the innovation provides forviewing computing device interaction done in the past in a video-likeformat. Past device interaction can be viewed across one of more devices(e.g., laptop, desktop, cell phone, tablet, television, gaming console,and so on). Thus, some, most or all computing device interaction can berecorded, indexed and archived in a video format for subsequent search,retrieval and viewing. In some embodiments, various metadata (whether ornot enriched) can be multiplexed into and/or otherwise attached to therecorded, indexed, and archived video file to correlate such metadatawith the user's on-screen actions. As described above, such metadata caninclude audio, location, bio-data and so on. Such feature allows for auser to see in a visual manner his/her computing device interaction fora significant and/or indefinite span of time. This can be one way offacilitating the Personal Timeline and/or visual history, describedabove.

Moreover, all this data and/or metadata can be cached locally but heldin a cloud computing environment. The data can be encrypted and/orunencrypted. Additionally, the data can be continuously (and/or lessfrequently) manipulated, analyzed, and/or examined, and can besubsequently put to use to fulfill user intents, as described above.Furthermore, a single system can collect and/or otherwise be associatedwith data for multiple users. In such case, the system/personal AIs canstudy the data sets of the multiple users (e.g., which data sets can bestored together in the cloud) so as to efficiently learn/extrapolateabstract task patterns (e.g., learning general patterns regardingsending an email, opening an internet browser, and so on), as describedabove.

FIG. 9A illustrates a mouse pointer selecting a movie button (M) whichgenerates a slide bar shown below with a set of thumbnails associatedwith video thumbnails that represent portions of video files of usercomputing device interaction across a range of time. The thumbnailsfacilitate the user to visually glance through moments in his/hercomputing device interaction and quickly find a state of interest. Abovethe set of thumbnails is a scroll bar with a circle slider that allowsfor the user to scroll across an entire range of time, and as the useris scrolling through different time ranges the thumbnails automaticallyupdate to reflect the time range selected by the user. As discussedabove, dirty rectangle methods can be implemented to determine a levelof impact/importance/relevance of particular changes in display screens(e.g., a change in a displayed clock is not substantively important,whereas a change in a subject line of an email is probably substantivelyimportant). Thus, multiple screens and/or video portions/segmentscontaining unimportant changes can be represented by a single thumbnailimage, so as to not overwhelm the user with redundant information.

Turning to FIG. 9B, a search tool on the left of the screen (FIG. 9A)can be initiated that opens a calendar where a user can select one ormore dates of interest to facilitate converging on a set of video filesof potential interest. Additionally, as shown in FIG. 9C, the user canfurther filter the search by application used (e.g., Facebook, Twitter,Word, PowerPoint, Google Search, and so on) and/or device used (e.g.,desktop computer, laptop, smart phone, tablet, and so on). FIG. 9Dillustrates further filtering the search using keywords, tags or thelike. It is to be appreciated the voice-based commands and/or naturallanguage searching or the like can be employed to converge on computingdevice interactions of interest. Additionally, extrinsic or contextualinformation that was collected and stored could also be utilized in thesearch. For example, the user may have recalled being on his/her cellphone in a park and it was raining that day. The system could utilizeGPS tags, weather information, and so on to determine, location, dateand time when the user was in that context and then quickly provide therecorded video files of the computing device interaction. As explainedabove, this can be performed by using focus objects/sections (e.g.,important on-screen markers/changes that are recognizable by thesystem/personal AI). That is, the search can be multifaceted, includingsearch criteria such as on which device the search was performed, wherethe search was performed, when the search was performed, what theweather was like when the search was performed, and so on.

FIG. 9E illustrates a feature where the user can select a particularsection of a video file. The user can export, save, transmit or create aprivate or public link for such section of video as shown in FIG. 9F. Asdiscussed above, various time segments/clusters can be created, whereeach segment/cluster pertains to a certain substantive goal/action bythe user. These clusters/segments can be used to effectively filterthrough the Personal Timeline/visual history (e.g., only video segmentspertaining to a particular substantive goal/task can be shown, therebynot forcing the user to wade through irrelevant time segments/clusters).Another aspect provides for the user to select a portion of the videoscreen and drag and drop it as shown in FIG. 9G. FIG. 9H illustrates anaspect that provides for selecting application(s) activity to beignored. FIG. 9I depicts providing the user with various storageoptions. For example, price of storage can be based on amount of storagespace required (e.g., one price for 500 gigabytes) as well as amount oftime stored (e.g., one price for 6 months of storage). FIG. 9J depictsan embodiment on a cell phone. Data streams across different devices canbe interleaved so that they can be provided with time series computingdevice interactions as well as knowledge of which device was used andwhen. Accordingly, user context (e.g., focus of attention, howdistracted, frequency of context switching, and so on) can bedetermined. The video storage is highly dynamic and can record at verygranular levels (e.g., every single keystroke); compression techniquescan be leveraged to optimize storage utilization. The video files andsystem can reside in a cloud, on one or more devices, and in adistributed framework for ease of access. Thus, the system allows for auser's entire set of computing device interactions to be stored andeasily retrieved and viewed; a multi-layered data stream is generated.User interactions can be stored in a specific track, windows positioningcan be stored in a different track, location and/or device can be othertracks or layers that can be stacked, and so on. The analysis component114 can analyze and learn these layers. In an embodiment, less than 50GB of storage is simply needed to store an entire year of user computingdevice interactions. This minimal CPU usage can be leveraged byimplementing low-level hardware functions and smart pipe-filling andhandling. In an aspect, each and every video frame can be analyzed,indexed and learned to a pixel or text level to provide for a highlyextensible and searchable platform for reviewing past computing deviceinteraction.

The system 100 can learn and identify communication events, completionevents, entertainment events, and so on. In an embodiment, the system100 can learn user behavior and identify deficient or undesirableactions and provide feedback to the user to improve efficiency.

In an aspect, audio and transcription of the audio can be added aslayered, and such information can be searchable as well.

In an embodiment, employers can leverage such functionality formonitoring employee work performance, determining cause of errors,improving workflow, security, and so on, thereby providing substantiallycontinuous/constant monitoring of employees.

In an embodiment, the analysis component 114 in conjunction with modelgeneration component 108 can learn user behavior and providerecommendations to optimize future computing device interaction.

The subject embodiment provides for a life recorder that cancontinuously run in the background with low processing load and lowstorage consumption and store, tag, index computing device interactionin video format, and provide for an elegant tool to quickly retrieve andview past computing device interaction.

In order to provide a context for the various aspects of the disclosedsubject matter, FIG. 10 as well as the following discussion are intendedto provide a general description of a suitable environment in which thevarious aspects of the disclosed subject matter can be implemented. FIG.10 illustrates a block diagram of an example, non-limiting operatingenvironment in which one or more embodiments described herein can befacilitated. Repetitive description of like elements employed in otherembodiments described herein is omitted for sake of brevity.

With reference to FIG. 10, a suitable operating environment 1000 forimplementing various aspects of this disclosure can also include acomputer 1012. The computer 1012 can also include a processing unit1014, a system memory 1016, and a system bus 1018. The system bus 1018couples system components including, but not limited to, the systemmemory 1016 to the processing unit 1014. The processing unit 1014 can beany of various available processors. Dual microprocessors and othermultiprocessor architectures also can be employed as the processing unit1014. The system bus 1018 can be any of several types of busstructure(s) including the memory bus or memory controller, a peripheralbus or external bus, and/or a local bus using any variety of availablebus architectures including, but not limited to, Industrial StandardArchitecture (ISA), Micro-Channel Architecture (MSA), Extended ISA(EISA), Intelligent Drive Electronics (IDE), VESA Local Bus (VLB),Peripheral Component Interconnect (PCI), Card Bus, Universal Serial Bus(USB), Advanced Graphics Port (AGP), Firewire (IEEE 13104), and SmallComputer Systems Interface (SCSI).

The system memory 1016 can also include volatile memory 1020 andnonvolatile memory 1022. The basic input/output system (BIOS),containing the basic routines to transfer information between elementswithin the computer 1012, such as during start-up, is stored innonvolatile memory 1022. Computer 1012 can also includeremovable/non-removable, volatile/non-volatile computer storage media.FIG. 10 illustrates, for example, a disk storage 1024. Disk storage 1024can also include, but is not limited to, devices like a magnetic diskdrive, floppy disk drive, tape drive, Jaz drive, Zip drive, LS-100drive, flash memory card, or memory stick. The disk storage 1024 alsocan include storage media separately or in combination with otherstorage media. To facilitate connection of the disk storage 1024 to thesystem bus 1018, a removable or non-removable interface is typicallyused, such as interface 1026. FIG. 10 also depicts software that acts asan intermediary between users and the basic computer resources describedin the suitable operating environment 1000. Such software can alsoinclude, for example, an operating system 1028. Operating system 1028,which can be stored on disk storage 1024, acts to control and allocateresources of the computer 1012.

System applications 1030 take advantage of the management of resourcesby operating system 1028 through program modules 1032 and program data1034, e.g., stored either in system memory 1016 or on disk storage 1024.It is to be appreciated that this disclosure can be implemented withvarious operating systems or combinations of operating systems. A userenters commands or information into the computer 1012 through inputdevice(s) 1036. Input devices 1036 include, but are not limited to, apointing device such as a mouse, trackball, stylus, touch pad, keyboard,microphone, joystick, game pad, satellite dish, scanner, TV tuner card,digital camera, digital video camera, web camera, and the like. Theseand other input devices connect to the processing unit 1014 through thesystem bus 1018 via interface port(s) 1038. Interface port(s) 1038include, for example, a serial port, a parallel port, a game port, and auniversal serial bus (USB). Output device(s) 1040 use some of the sametype of ports as input device(s) 1036. Thus, for example, a USB port canbe used to provide input to computer 1012, and to output informationfrom computer 1012 to an output device 1040. Output adapter 1042 isprovided to illustrate that there are some output devices 1040 likemonitors, speakers, and printers, among other output devices 1040, whichrequire special adapters. The output adapters 1042 include, by way ofillustration and not limitation, video and sound cards that provide ameans of connection between the output device 1040 and the system bus1018. It should be noted that other devices and/or systems of devicesprovide both input and output capabilities such as remote computer(s)1044.

Computer 1012 can operate in a networked environment using logicalconnections to one or more remote computers, such as remote computer(s)1044. The remote computer(s) 1044 can be a computer, a server, a router,a network PC, a workstation, a microprocessor based appliance, a peerdevice or other common network node and the like, and typically can alsoinclude many or all of the elements described relative to computer 1012.For purposes of brevity, only a memory storage device 1046 isillustrated with remote computer(s) 1044. Remote computer(s) 1044 islogically connected to computer 1012 through a network interface 1048and then physically connected via communication connection 1050. Networkinterface 1048 encompasses wire and/or wireless communication networkssuch as local-area networks (LAN), wide-area networks (WAN), cellularnetworks, etc. LAN technologies include Fiber Distributed Data Interface(FDDI), Copper Distributed Data Interface (CDDI), Ethernet, Token Ringand the like. WAN technologies include, but are not limited to,point-to-point links, circuit switching networks like IntegratedServices Digital Networks (ISDN) and variations thereon, packetswitching networks, and Digital Subscriber Lines (DSL). Communicationconnection(s) 1050 refers to the hardware/software employed to connectthe network interface 1048 to the system bus 1018. While communicationconnection 1050 is shown for illustrative clarity inside computer 1012,it can also be external to computer 1012. The hardware/software forconnection to the network interface 1048 can also include, for exemplarypurposes only, internal and external technologies such as, modemsincluding regular telephone grade modems, cable modems and DSL modems,ISDN adapters, and Ethernet cards.

The present invention may be a system, a method, an apparatus and/or acomputer program product at any possible technical detail level ofintegration. The computer program product can include a computerreadable storage medium (or media) having computer readable programinstructions thereon for causing a processor to carry out aspects of thepresent invention. The computer readable storage medium can be atangible device that can retain and store instructions for use by aninstruction execution device. The computer readable storage medium canbe, for example, but is not limited to, an electronic storage device, amagnetic storage device, an optical storage device, an electromagneticstorage device, a semiconductor storage device, or any suitablecombination of the foregoing. A non-exhaustive list of more specificexamples of the computer readable storage medium can also include thefollowing: a portable computer diskette, a hard disk, a random accessmemory (RAM), a read-only memory (ROM), an erasable programmableread-only memory (EPROM or Flash memory), a static random access memory(SRAM), a portable compact disc read-only memory (CD-ROM), a digitalversatile disk (DVD), a memory stick, a floppy disk, a mechanicallyencoded device such as punch-cards or raised structures in a groovehaving instructions recorded thereon, and any suitable combination ofthe foregoing. A computer readable storage medium, as used herein, isnot to be construed as being transitory signals per se, such as radiowaves or other freely propagating electromagnetic waves, electromagneticwaves propagating through a waveguide or other transmission media (e.g.,light pulses passing through a fiber-optic cable), or electrical signalstransmitted through a wire.

Computer readable program instructions described herein can bedownloaded to respective computing/processing devices from a computerreadable storage medium or to an external computer or external storagedevice via a network, for example, the Internet, a local area network, awide area network and/or a wireless network. The network can comprisecopper transmission cables, optical transmission fibers, wirelesstransmission, routers, firewalls, switches, gateway computers and/oredge servers. A network adapter card or network interface in eachcomputing/processing device receives computer readable programinstructions from the network and forwards the computer readable programinstructions for storage in a computer readable storage medium withinthe respective computing/processing device. Computer readable programinstructions for carrying out operations of the present invention can beassembler instructions, instruction-set-architecture (ISA) instructions,machine instructions, machine dependent instructions, microcode,firmware instructions, state-setting data, configuration data forintegrated circuitry, or either source code or object code written inany combination of one or more programming languages, including anobject oriented programming language such as Smalltalk, C++, or thelike, and procedural programming languages, such as the “C” programminglanguage or similar programming languages. The computer readable programinstructions can execute entirely on the user's computer, partly on theuser's computer, as a stand-alone software package, partly on the user'scomputer and partly on a remote computer or entirely on the remotecomputer or server. In the latter scenario, the remote computer can beconnected to the user's computer through any type of network, includinga local area network (LAN) or a wide area network (WAN), or theconnection can be made to an external computer (for example, through theInternet using an Internet Service Provider). In some embodiments,electronic circuitry including, for example, programmable logiccircuitry, field-programmable gate arrays (FPGA), or programmable logicarrays (PLA) can execute the computer readable program instructions byutilizing state information of the computer readable programinstructions to personalize the electronic circuitry, in order toperform aspects of the present invention.

Aspects of the present invention are described herein with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems), and computer program products according to embodiments of theinvention. It will be understood that each block of the flowchartillustrations and/or block diagrams, and combinations of blocks in theflowchart illustrations and/or block diagrams, can be implemented bycomputer readable program instructions. These computer readable programinstructions can be provided to a processor of a general purposecomputer, special purpose computer, or other programmable dataprocessing apparatus to produce a machine, such that the instructions,which execute via the processor of the computer or other programmabledata processing apparatus, create means for implementing thefunctions/acts specified in the flowchart and/or block diagram block orblocks. These computer readable program instructions can also be storedin a computer readable storage medium that can direct a computer, aprogrammable data processing apparatus, and/or other devices to functionin a particular manner, such that the computer readable storage mediumhaving instructions stored therein comprises an article of manufactureincluding instructions which implement aspects of the function/actspecified in the flowchart and/or block diagram block or blocks. Thecomputer readable program instructions can also be loaded onto acomputer, other programmable data processing apparatus, or other deviceto cause a series of operational acts to be performed on the computer,other programmable apparatus or other device to produce a computerimplemented process, such that the instructions which execute on thecomputer, other programmable apparatus, or other device implement thefunctions/acts specified in the flowchart and/or block diagram block orblocks.

The flowchart and block diagrams in the Figures illustrate thearchitecture, functionality, and operation of possible implementationsof systems, methods, and computer program products according to variousembodiments of the present invention. In this regard, each block in theflowchart or block diagrams can represent a module, segment, or portionof instructions, which comprises one or more executable instructions forimplementing the specified logical function(s). In some alternativeimplementations, the functions noted in the blocks can occur out of theorder noted in the Figures. For example, two blocks shown in successioncan, in fact, be executed substantially concurrently, or the blocks cansometimes be executed in the reverse order, depending upon thefunctionality involved. It will also be noted that each block of theblock diagrams and/or flowchart illustration, and combinations of blocksin the block diagrams and/or flowchart illustration, can be implementedby special purpose hardware-based systems that perform the specifiedfunctions or acts or carry out combinations of special purpose hardwareand computer instructions.

While the subject matter has been described above in the general contextof computer-executable instructions of a computer program product thatruns on a computer and/or computers, those skilled in the art willrecognize that this disclosure also can or can be implemented incombination with other program modules. Generally, program modulesinclude routines, programs, components, data structures, etc., thatperform particular tasks and/or implement particular abstract datatypes. Moreover, those skilled in the art will appreciate that theinventive computer-implemented methods can be practiced with othercomputer system configurations, including single-processor ormultiprocessor computer systems, mini-computing devices, mainframecomputers, as well as computers, hand-held computing devices (e.g., PDA,phone), microprocessor-based or programmable consumer or industrialelectronics, and the like. The illustrated aspects can also be practicedin distributed computing environments in which tasks are performed byremote processing devices that are linked through a communicationsnetwork. However, some, if not all aspects of this disclosure can bepracticed on stand-alone computers. In a distributed computingenvironment, program modules can be located in both local and remotememory storage devices.

As used in this application, the terms “component,” “system,”“platform,” “interface,” and the like, can refer to and/or can include acomputer-related entity or an entity related to an operational machinewith one or more specific functionalities. The entities disclosed hereincan be either hardware, a combination of hardware and software,software, or software in execution. For example, a component can be, butis not limited to being, a process running on a processor, a processor,an object, an executable, a thread of execution, a program, and/or acomputer. By way of illustration, both an application running on aserver and the server can be a component. One or more components canreside within a process and/or thread of execution and a component canbe localized on one computer and/or distributed between two or morecomputers. In another example, respective components can execute fromvarious computer readable media having various data structures storedthereon. The components can communicate via local and/or remoteprocesses such as in accordance with a signal having one or more datapackets (e.g., data from one component interacting with anothercomponent in a local system, distributed system, and/or across a networksuch as the Internet with other systems via the signal). As anotherexample, a component can be an apparatus with specific functionalityprovided by mechanical parts operated by electric or electroniccircuitry, which is operated by a software or firmware applicationexecuted by a processor. In such a case, the processor can be internalor external to the apparatus and can execute at least a part of thesoftware or firmware application. As yet another example, a componentcan be an apparatus that provides specific functionality throughelectronic components without mechanical parts, wherein the electroniccomponents can include a processor or other means to execute software orfirmware that confers at least in part the functionality of theelectronic components. In an aspect, a component can emulate anelectronic component via a virtual machine, e.g., within a cloudcomputing system.

In addition, the term “or” is intended to mean an inclusive “or” ratherthan an exclusive “or.” That is, unless specified otherwise, or clearfrom context, “X employs A or B” is intended to mean any of the naturalinclusive permutations. That is, if X employs A; X employs B; or Xemploys both A and B, then “X employs A or B” is satisfied under any ofthe foregoing instances. Moreover, articles “a” and “an” as used in thesubject specification and annexed drawings should generally be construedto mean “one or more” unless specified otherwise or clear from contextto be directed to a singular form. As used herein, the terms “example”and/or “exemplary” are utilized to mean serving as an example, instance,or illustration. For the avoidance of doubt, the subject matterdisclosed herein is not limited by such examples. In addition, anyaspect or design described herein as an “example” and/or “exemplary” isnot necessarily to be construed as preferred or advantageous over otheraspects or designs, nor is it meant to preclude equivalent exemplarystructures and techniques known to those of ordinary skill in the art.

As it is employed in the subject specification, the term “processor” canrefer to substantially any computing processing unit or devicecomprising, but not limited to, single-core processors;single-processors with software multithread execution capability;multi-core processors; multi-core processors with software multithreadexecution capability; multi-core processors with hardware multithreadtechnology; parallel platforms; and parallel platforms with distributedshared memory. Additionally, a processor can refer to an integratedcircuit, an application specific integrated circuit (ASIC), a digitalsignal processor (DSP), a field programmable gate array (FPGA), aprogrammable logic controller (PLC), a complex programmable logic device(CPLD), a discrete gate or transistor logic, discrete hardwarecomponents, or any combination thereof designed to perform the functionsdescribed herein. Further, processors can exploit nano-scalearchitectures such as, but not limited to, molecular and quantum-dotbased transistors, switches and gates, in order to optimize space usageor enhance performance of user equipment. A processor can also beimplemented as a combination of computing processing units. In thisdisclosure, terms such as “store,” “storage,” “data store,” datastorage,” “database,” and substantially any other information storagecomponent relevant to operation and functionality of a component areutilized to refer to “memory components,” entities embodied in a“memory,” or components comprising a memory. It is to be appreciatedthat memory and/or memory components described herein can be eithervolatile memory or nonvolatile memory, or can include both volatile andnonvolatile memory. By way of illustration, and not limitation,nonvolatile memory can include read only memory (ROM), programmable ROM(PROM), electrically programmable ROM (EPROM), electrically erasable ROM(EEPROM), flash memory, or nonvolatile random access memory (RAM) (e.g.,ferroelectric RAM (FeRAM). Volatile memory can include RAM, which canact as external cache memory, for example. By way of illustration andnot limitation, RAM is available in many forms such as synchronous RAM(SRAM), dynamic RAM (DRAM), synchronous DRAM (SDRAM), double data rateSDRAM (DDR SDRAM), enhanced SDRAM (ESDRAM), Synchlink DRAM (SLDRAM),direct Rambus RAM (DRRAM), direct Rambus dynamic RAM (DRDRAM), andRambus dynamic RAM (RDRAM). Additionally, the disclosed memorycomponents of systems or computer-implemented methods herein areintended to include, without being limited to including, these and anyother suitable types of memory.

What has been described above include mere examples of systems andcomputer-implemented methods. It is, of course, not possible to describeevery conceivable combination of components or computer-implementedmethods for purposes of describing this disclosure, but one of ordinaryskill in the art can recognize that many further combinations andpermutations of this disclosure are possible. Furthermore, to the extentthat the terms “includes,” “has,” “possesses,” and the like are used inthe detailed description, claims, appendices and drawings such terms areintended to be inclusive in a manner similar to the term “comprising” as“comprising” is interpreted when employed as a transitional word in aclaim.

The descriptions of the various embodiments have been presented forpurposes of illustration, but are not intended to be exhaustive orlimited to the embodiments disclosed. Many modifications and variationswill be apparent to those of ordinary skill in the art without departingfrom the scope and spirit of the described embodiments. The terminologyused herein was chosen to best explain the principles of theembodiments, the practical application or technical improvement overtechnologies found in the marketplace, or to enable others of ordinaryskill in the art to understand the embodiments disclosed herein.

What is claimed is:
 1. A system to facilitate determining or inferringuser intent, comprising: a processor that executes computer implementedcomponents stored in memory; a machine learning model trained onanalysis of graphical image data associated with screen display todetermine or infer user intent; an input component that receives imagedata regarding a screen display associated with user interaction with acomputing device; an analysis component that employs the model todetermine or infer user intent based on the image data analysis; and anaction component that provisions services to the user as a function ofthe determined or inferred user intent.
 2. The system of claim 1,further comprising a gaming component that gamifies interaction with theuser in connection with explicitly training the model.
 3. The system ofclaim 1, further comprising a reward component that exposesfunctionality associated with the trained model as a function of usertraining efforts.
 4. The system of claim 1, further an interactivecomponent that shares status of training of the model with otherentities.
 5. The system of claim 1, further comprising an avatarcomponent that generates an avatar that takes interacts with the userbased in part on the model and the action component.
 6. The system ofclaim 1, wherein the model employs a recursive learning algorithm orbackward propagation of learning across other models or continuouslearning algorithm.
 7. The system of claim 1, wherein the model learnsimpact of respective images and directs the action component to reviseactions as a function of the learned impact.
 8. The system of claim 7,further comprising an optimization component that generates inferences,based on the model, regarding potential points of failure, weakness orbottlenecks in connection with taking automated action.
 9. The system ofclaim 8, comprising a scheduling component that employs the trainedmodel to schedule activities for the user.
 10. The system of claim 2,wherein the gaming component displays progress of training of the model.11. The system of claim 10, wherein the gaming component displaysexposed functionalities of the trained model.
 12. The system of claim 1,further comprising a training component that generates an interface forexplicitly training the model by the user.
 13. The system of claim 12,wherein the training component questions the user regarding accuracy ofdeterminations or inferences generated by the model.
 14. The system ofclaim 1, wherein the model identifies two or more applications runningconcurrently on the screen, and determines or infers user intentregarding the respective applications.
 15. The system of claim 1,wherein the model analyzes multiple sets of screen image data in a timeseries manner
 16. A user intent prediction method, comprising: using aprocessor that executes computer implemented components stored in memoryto perform the following acts; using a machine learning model trained onanalysis of graphical image data associated with screen display todetermine or infer user intent; using an input component to receivesimage data regarding a screen display associated with user interactionwith a computing device; using an analysis component that employs themodel to determine or infer user intent based on the image dataanalysis; and using an action component that provisions services to theuser as a function of the determined or inferred user intent.
 17. Themethod of claim 16, further comprising employing a recursive learningalgorithm or backward propagation of learning across other models orcontinuous learning algorithm.
 18. The method of claim 16, furthercomprising using a gaming component to gamify interaction with the userin connection with explicitly training the model.
 19. The method ofclaim 11, further comprising learning impact of respective images anddirecting an action component to revise actions as a function of thelearned impact.
 20. A computer readable storage device comprisinginstructions that, in response to execution, cause a system comprising aprocessor to perform operations, comprising: using a machine learningmodel trained on analysis of graphical image data associated with screendisplay to determine or infer user intent; using an input component toreceives image data regarding a screen display associated with userinteraction with a computing device; using an analysis component thatemploys the model to determine or infer user intent based on the imagedata analysis; and using an action component that provisions services tothe user as a function of the determined or inferred user intent.